Nucleic Acids Research
◐ Oxford University Press (OUP)
Preprints posted in the last 90 days, ranked by how well they match Nucleic Acids Research's content profile, based on 1128 papers previously published here. The average preprint has a 0.81% match score for this journal, so anything above that is already an above-average fit.
Kases, K. J.; Marchante, P. G.; Witteveldt, J.; Peris, G.; Heras, S. R.; Macias, S.
Show abstract
Antiviral defence mechanisms are typically activated upon sensing virus-derived nucleic acids. During replication, viruses generate double-stranded RNA (dsRNA) intermediates that the innate immune system can sense, triggering several defence pathways. Conversely, mammalian cells avoid accumulating their own endogenous dsRNA to prevent activating these defence mechanisms. However, we demonstrate that mammalian embryonic stem cells (ESCs) accumulate endogenous dsRNA without activating these responses, as they lack all classical dsRNA-mediated antiviral pathways. To identify these endogenous dsRNAs, we have developed a method that includes an antibody-based purification coupled to RNase I treatment to enrich bona fide dsRNAs. The RNase I treatment results in an enrichment on sense/antisense and A-to-I edited transcripts, suggesting that they are true dsRNA in cells. Our refined protocol reveals that transposable elements (TEs), primarily the young elements from the LINE and LTR classes, are the predominant sources of dsRNA in ESCs. This approach will be useful for investigating the role of dsRNA in disease settings, such as autoimmunity or cancer, where endogenous dsRNA accumulation has also been observed.
Adiego-Perez, B.; Fluit, D.; Ludwig, C.; Berger, M.; Hohlbein, J.; Staals, R. H.; ten Wolde, P. R.; van der Oost, J.; Claassens, N. J.; Olivi, L.
Show abstract
Escherichia coli couples the initiation of DNA replication with cell size by modulating the activity of the replication initiator protein DnaA. The activity of DnaA is regulated by both its interconversion between an active and inactive form and its titration on binding sites on the chromosome. Whereas its interconversion has been thoroughly studied, the extent to which DnaA titration can control replication initiation is poorly understood. Here, we describe the control of E. coli DNA replication via titration by modulating the expression of an always-active DnaA variant in four growth conditions. While we obtained stable cell cycles during slow growth, faster growth associated with overlapping replication forks led to replicative instability and DNA damage. Overall, our results provide insights into the limits of titration-based systems in the control of genome replication and their potential role in the evolutionary trajectory of E. coli. Finally, this study provides design principles for a simplified, titration-only regulatory mechanism for DNA replication in synthetic cells.
Wolfram-Schauerte, M.; Trust, C.; Waffenschmidt, N.; Nieselt, K.
Show abstract
Time-resolved transcriptomic profiling has been used to study phage-host interactions for more than a decade. However, the resulting datasets are not readily accessible for custom re-analysis, and resources are lacking that provide standardized processing, storage, and analysis of transcriptomes from phage infections. Here, we present the PhageExpressionAtlas, the first bioinformatics resource for storing time-resolved dual RNA-sequencing data from phage infections. This data was processed uniformly using a custom analysis pipeline and is presented for interactive exploration through visualisation. The PhageExpressionAtlas currently hosts 42 datasets from 23 studies. Using the PhageExpressionAtlas, we replicate key findings from original publications and extend hypothesis testing across multiple phage-host systems. By systematically querying and analyzing the underlying database, we evaluate approaches to phage gene classification and show that uncharacterized phage genes are expressed across all infection phases. Moreover, we provide a comprehensive view of the expression dynamics of anti-phage defenses as well as host- and phage-encoded anti-defense systems in the infection context, indicating unique and conserved patterns of transcriptional regulation underlying bacterial anti-phage immunity and phage counter-strategies. Together, the PhageExpressionAtlas is a unifying resource that democratizes transcriptomics-driven analyses of phage-host interactions and supports integrative cross-study assessment.
Domingues-Silva, B.; Azzalin, C. M.
Show abstract
Mammalian telomeric DNA comprises long tracts of tandem TTAGGG repeats. The same repeats are also found at internal chromosomal regions called interstitial telomeric sequences (ITSs). Telomeres are transcribed into UUAGGG-containing transcripts, named TERRA, which serve multiple functions in maintaining telomere integrity. Complementary RNAs containing C-rich telomeric repeats, named ARIA, have also been identified in few yeast mutants and mammalian cells with dysfunctional telomeres. The molecular features and functions of ARIA remain understudied, mainly due to its low abundance and the lack of suitable cellular systems. Here, we show that Chinese hamster ovary (CHO) cells produce abundant TERRA and ARIA transcripts, predominantly originating from ITSs. Both RNAs are polyadenylated, exhibit relatively short half-lives and form large cellular foci. We also show that ARIA depletion leads to exposure of single-stranded (ss) DNA at ITSs and that ssDNA exposure increases when ITS DNA is damaged. SsDNA formation does not require the DNA damage signaling kinases ATM and ATR, nor the exonucleases DNA2 and EXO1; however, ATM prevents excessive ssDNA accumulation when ARIA function is inhibited. These findings establish CHO cells as a powerful model to dissect telomeric RNA functions and reveal ARIA as a key regulator of telomeric repeat DNA integrity.
Williams, J. D.; Tesmer, V. M.; Kannoly, S.; Shibuya, H.; Nandakumar, J.
Show abstract
Telomeres are nucleoprotein structures at the ends of eukaryotic chromosomes that safeguard them from triggering inappropriate DNA damage signaling. POT1, a member of the mammalian shelterin complex, binds single-stranded (ss) telomeric DNA and blocks the activation of the ATR kinase-mediated DNA damage response at telomeres. Yet until recently, it was poorly understood how the double-stranded (ds)-ss telomeric junction was protected from DNA damage response factors. An initial study of the DNA-binding activity of human POT1 (hPOT1) using systematic evolution of ligands by exponential enrichment (SELEX) and subsequent investigation revealed that POT1 contains a binding pocket, known as the POT-hole, that binds the 5 phosphorylated dC of the telomeric ds-ss junction. The amino acid residues composing the POT-hole show full sequence identity with telomeric proteins from diverse eukaryotes, including Caenorhabditis elegans POT-1. The current study builds on this SELEX method, developing an extensive analysis pipeline for SELEX datasets sequenced by next-generation sequencing and achieving a deeper analysis of the resulting sequences. We validated our approach by applying it to the DNA-binding domain of hPOT1, yielding results consistent with a previous SELEX study. Furthermore, we employ our pipeline to characterize the DNA-binding activity of C. elegans proteins that are considered homologs of hPOT1: POT-1, POT-2, POT-3, and MRT-1. Our analysis suggests that all four proteins show a binding preference for G-enriched DNA sequences, with POT-1 additionally binding secondary structural elements. Overall, we present a bioinformatics pipeline that is accessible and applicable for determining the nucleic acid-binding properties of a variety of proteins.
Warkentin, R.; Pyle, A. M.
Show abstract
Ribozyme-based permuted intron-exon (PIE) systems offer a protein-independent route to circRNA production, but existing platforms require elevated temperatures that promote RNA degradation. Here we report the first application of the Candida albicans mitochondrial large subunit (C.a.mtLSU) group I intron as a PIE platform for circRNA synthesis, which we term PCanPIE (Pyle lab Candida PIE). We evaluated three peripheral stems, P5, P6b, and P8, as permutation sites and demonstrated that all three support circularization under near-physiological conditions (25{degrees}C, 6 mM MgCl2), without the 55{degrees}C heating step required by existing PIE systems. Kinetic analysis revealed that permutation site does not affect the observed splicing rate constant but does influence PCanPIE folding and therefore influences circularization efficiency. The P6b permutation yielded the highest circularization efficiency, with 95 % of the precursor splicing to produce circRNA. Optimization of spacer sequences flanking the circRNA payload eliminated interference from structured native exon sequences and enabled efficient circularization of RNAs up to 1,657 nt, including structured, repetitive, and naturally occurring sequences. Together, these results establish PCanPIE as a versatile and near-physiologically active addition to the group I intron PIE toolkit.
Nguyen, K. T.; Lett, N. W.; Gravel, C. M.; Jo, S.; Shi, Y.; Narayan, M.; Sharma, S.; Sharma, C. M.; Berry, K. E.
Show abstract
Post-transcriptional gene regulation is a key mechanism for bacterial stress responses and virulence, and RNA-mediated regulation frequently relies on global RNA-binding proteins (RBPs). A pair of interacting KH-domain proteins (KhpA and KhpB) have recently been identified as global RBPs in several bacterial species. To better understand their molecular functions, we employed bacterial two- and three-hybrid (B2H/B3H) assays in an E. coli reporter system to analyze protein-protein and protein-RNA interactions of KhpA/B orthologs from three human pathogens: Campylobacter jejuni, Helicobacter pylori, and Clostridioides difficile. Protein-protein interactions were conserved across all species, with KhpA-KhpB heterodimers forming more robustly than either homodimer and KhpA homodimerizing more readily than KhpB. On the other hand, protein-RNA interactions were more varied across species: C. jejuni and C. difficile KhpA bound both species-specific and non-specific RNAs, but H. pylori KhpA--and KhpB orthologs from all species--showed no RNA interaction in B3H assays. Site-directed mutagenesis experiments demonstrated that residues in the GXXG motif of KhpA are critical for RNA interaction and differences in these residues account for the distinct RNA-binding behaviors of KhpA orthologs. Collectively, these findings provide a cross-species, molecular view of how KhpA and KhpB recognize one another and RNA ligands to regulate gene expression.
Tariq, K.; Polenkowski, M.; Quin, J.; Sugathan, A.; Isacson, S.; Jakobsson, S.; Enervald, E.; von Euler, A.; Öst, A.; Visa, N.; Östlund Farrants, A.-K.
Show abstract
The eukaryotic ribosomal genes are multi-copy genes, transcribed from the rDNA, and approximately one third of them is actively transcribed in differentiated cells. A number of lncRNAs have been identified from the intergenic spacer between the rRNA genes, among those the spacer RNA and PAPAS that are involved silencing of rRNA gene copies by altering the chromatin configuration. Here, we have identified lncRNAs that are transcribed from the human rDNA loci and modulate the loci; IGS38 positively regulates rRNA gene transcription by associating to the 47S rRNA gene promoter and modulating the rRNA promoter accessibility while IGS32as associates with heterochromatin. IGS38 binds to the 47S gene promoter through the RNA pol I factors TAF1C and RRN3 as well as the Williams Syndrome Transcription Factor (WSTF), a component of the B-WICH chromatin remodelling complex. The increased accessibility of the promoter stabilises the architectural protein Upstream Binding Factor (UBF) at the rRNA promoter, thereby facilitating RNA pol I promoter escape. Furthermore, IGS38 knock down displays and increased dsRNA abundance in the cytoplasm with a weak induction of the dsRNA sensor OAS2, typically induced by interferon and viral dsRNA. Overall, the both IGS38 and IGS32as are chromatin associated lncRNAs involved in rDNA chromatin changes, and IGS38 is stimulating, together with WSTF, rRNA gene transcription in human cells. Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=199 HEIGHT=200 SRC="FIGDIR/small/722362v1_ufig1.gif" ALT="Figure 1"> View larger version (29K): org.highwire.dtl.DTLVardef@14d4159org.highwire.dtl.DTLVardef@fd773forg.highwire.dtl.DTLVardef@a0030dorg.highwire.dtl.DTLVardef@1285301_HPS_FORMAT_FIGEXP M_FIG C_FIG IGS stabilises 47S rRNA transcription, disruption of IGS38 expression leads to the release of dsRNA in the cytoplasm and a weak immune activation of OAS2. Created by biorender (https://biorender.com/shortURL)
Robertson, N.; Mikolajczyk, J.; Garcia-Sandoval, A. C.; Helwak, A.; Major, M. L.; Emadali, A.; Tollervey, D.; Turowski, T. W.
Show abstract
Defining high-confidence RNA interaction sites for specific proteins is essential to understand RNA biology, but existing methods face trade-offs between specificity, sensitivity, and experimental accessibility. Here, we present fluorescent Cross-linking and analysis of cDNAs (fCRAC), a mammalian-cell optimized update to the CRAC protocol. In fCRAC, a fluorescent adaptor is used, in place of radiolabeling, to visualize RNA-protein complexes during gel-purification. fCRAC retains the tandem affinity purification and stringent, denaturing conditions of classical CRAC, enabling nucleotide-resolution mapping of protein:RNA interactions with high signal to noise ratio. We initially tested fCRAC using RPP25L, a component the RNase MRP and RNase P complexes. RPP25L almost exclusively bound to predicted, single sites in the RNA components (RMRP and RPPH1), showing excellent selectivity with nucleotide resolution. To support analysis of UV cross-linking data for more complex targets, we developed the trxtools package and example pipeline for standardized processing, quality control, and analysis of data from fCRAC and related methods. We include tailored strategies for repetitive RNA classes, such as tRNA and rRNA, which can be challenging to analyze using other approaches. We applied fCRAC and trxtools to define the RNA interactome of human CYCLON/CCDC86, a nuclear protein previously implicated in oncogenesis. This revealed specific interactions with rRNA, tRNA and ncRNAs involved in pre-rRNA and pre-tRNA processing. HighlightsO_LINucleotide-resolution definition of RNAs interacting with specific proteins, including rRNA and tRNA C_LIO_LIStringent denaturing purifications and robust visualization steps, with no requirement for radioactive labelling C_LIO_LITrxtools provides an integrated analysis pipeline with approaches for analyzing both single and multi-copy RNA species C_LI
SONNEVILLE, R.; EVRIN, C.; WRIGHT, J. E.; XIA, Y.; LABIB, K. P. M.
Show abstract
Eukaryotic cells regulate the assembly and activation of the essential DNA helicase at the heart of the chromosome replication machinery, to ensure that the chromosomes are copied just once per cell cycle. The Mcm10 protein is essential for helicase activation in budding yeast, but an equivalent role for MCM10 orthologues in animal cells has not been explored. Moreover, complete deletion of the mcm-10 gene is viable in the nematode Caenorhabditis elegans, suggesting the involvement of additional factors. Here we show that MCM-10 and a second factor called SLD-2 are recruited to chromatin after helicase assembly in the C. elegans early embryo and are jointly required for helicase activation. Moreover, deletion of the Mcm10 gene is viable in mouse embryonic stem cells, but causes synthetic lethality in the absence of RECQL4, which is the orthologue of SLD-2 in vertebrate species. Helicase activation is blocked in the combined absence of MCM10 and RECQL4, mirroring the situation in C. elegans. These findings indicate that metazoan helicase activation requires two conserved factors that are mutated in human disease syndromes.
Fiorentino, J.; Monti, M.; Armaos, A.; Vrachnos, D. M.; Di Rienzo, L.; Tartaglia, G. G.
Show abstract
RNA-binding proteins (RBPs) regulate essential aspects of RNA metabolism, yet accurately identifying RNA-binding domains (RBDs) and quantifying the impact of sequence variation on RNA-binding ability remain challenging. Here, we present HERCULES (Hybrid framEwoRk for RNA-binding domain loCalization and mUtation anaLysis using physicochemical and languagE modelS), a unified sequence-based framework for simultaneous RBD localization, global RNA-binding propensity prediction and mutation effect assessment. HERCULES integrates a fine-tuned protein language model with an explicit residue-level physicochemical module, combining global contextual representations with local mutation-sensitive descriptors. On an independent test set, the HERCULES global score discriminates RBPs from non-RBPs with an AUROC of 0.86. At residue resolution, HERCULES outperforms state-of-the-art sequence-based predictors in identifying canonical, non-canonical and putative RBDs across Pfam-annotated proteins. Using a curated dataset of experimentally validated RNA-binding-disrupting mutations, HERCULES correctly classifies 87% of deleterious variants, including single-amino acid substitutions. Evaluation on experimentally resolved protein-RNA complexes further demonstrates robust residue-level performance and improved generalization when contact annotations are augmented with AlphaFold3-predicted complexes. By unifying domain localization and mutation sensitivity within a single sequence-only framework, HERCULES provides a mechanistically interpretable approach for studying RNA-protein interactions. HERCULES is freely available at https://tools.tartaglialab.com/hercules and as an open-source Python package at https://github.com/tartaglialabIIT/hercules.git.
Tejedor, A. R.; Luengo-Marquez, J.; Iscar, J. O.; Garcia, J. R.; Ocana, A.; Collepardo-Guevara, R.; Gonzalez, P. L.; Espinosa, J. R.
Show abstract
RNA plays a central role in the formation and regulation of biomolecular condensates, yet a quantitative understanding of how RNA sequence, structure, and thermodynamics jointly determine phase behaviour, particularly in repeat expansion RNAs, remains incomplete. Here, we introduce RNA2PS, a sequence-specific RNA coarse-grained model for phase-separation that predicts RNA structure from sequence, achieving quantitative agreement for both single-stranded conformations and duplex helical geometry relative to crystallographic PDB structures. RNA2PS represents each nucleotide by two beads that separate the phosphate-ribose backbone from the base. This representation decouples electrostatic interactions from directional base pairing, while explicitly incorporating strand polarity (5' [->] 3') and local sequence context at the trimer level. Canonical and wobble base pairing are modelled through a multi-body potential with sequence-dependent coordination. Importantly, RNA2PS captures sequence-dependent duplex stability at the nearest-neighbour level and reproduces experimental melting temperatures across a diverse set of sequences. RNA2PS shows that phase separation of trinucleotide repeat RNAs is governed by transient inter-strand duplexes that form reversible cross-links. Competition between intra- and intermolecular base pairing regulates the density of labile RNA-RNA interactions, giving rise to strong sequence- and length-dependent differences in condensation that reproduce cellular RNA foci formation. Overall, RNA2PS provides a near-quantitative predictive framework that links sequence-encoded hybridization thermodynamics to mesoscale condensation of pure RNA sequences.
Whitford, W.; Musgrave, S. M.; Snell, R. G.; Jacobsen, J. C.
Show abstract
Variants affecting RNA splicing are a major contributor to human disease, yet the consequences of variants outside of the canonical splice motifs are often difficult to determine. Here, we present a protocol for minigene-based evaluation of candidate splice-altering variants. The methodology described includes locus-specific insert design, commercial gene fragment synthesis, and long-read sequencing. The combined approach enables rapid assay development and nucleotide level resolution of the effect on splice isoforms in vitro, providing a scalable framework for functional validation of predicted cryptic splice variants. Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=197 SRC="FIGDIR/small/723105v1_ufig1.gif" ALT="Figure 1"> View larger version (42K): org.highwire.dtl.DTLVardef@1a88cb5org.highwire.dtl.DTLVardef@adda98org.highwire.dtl.DTLVardef@1ea587corg.highwire.dtl.DTLVardef@574a63_HPS_FORMAT_FIGEXP M_FIG C_FIG
Karthik, A. S. P.; Das, A. B.
Show abstract
We developed a lightweight codon-based DNA Transformer equipped with multi-head self-attention and an adaptive classifier head, which achieves exon intron classification with high accuracy and also has moderate accuracy in CDS classification and splice site recognition. We named this model as ExIT (Exon-Intron Transformer). We have implemented codon tokenization for this model. This has been validated on the human genome with external validation from the chimpanzee genome. Further benchmarking has implied that our model is better than the existing models in the above tasks.
Ahmed, I.; Singh, A. P.; Chauhan, O. P.; Bhagat, K.; Gopinath, A.; Saikrishnan, K.
Show abstract
Type III restriction-modification (RM) enzymes are prominent bacterial defense against bacteriophage and invading foreign DNA that also modulate the hosts epigenetic landscape. Genome analysis of the host-adapted Mycoplasma bovis PG45 that has a very small genome revealed a Type III RM locus comprising one res and three mod genes. We characterized Mbo45V, a representative enzyme encoded by this locus. The enzyme forms a heterotrimeric complex consisting of two Mod subunits and one Res subunit. Mbo45V recognizes the asymmetric sequence 5'-YAATC-3' (Y = T/C) and cleaves DNA having at least two head-to-head oriented sites [~]26-28 bp away from the recognition site. Methylation of the second adenine of the target site using cofactor S-adenosylmethionine (SAM) protects DNA from restriction, while the SAM analogue sinefungin enhances DNA binding and cleavage. Kinetic studies reveal that Mbo45V exhibits relatively weak DNA binding affinity and an unusually high Km for SAM, indicating low cofactor affinity compared to prototypical enzymes such as EcoP15I. ATPase activity is strongly stimulated by cognate DNA and is inhibited upon methylation of the substrate, suggesting a regulatory interplay between methylation and restriction functions. Comparative analysis indicates that, although Mbo45V shares core mechanistic features with prototypes from Escherichia coli, its kinetic parameters are distinct. These differences likely reflect adaptation to the stable intracellular environment of M. bovis, in contrast to the fluctuating conditions encountered by the enteric bacteria.
Jaber, Q. Z.; Yu, N. J.; Masuda, I.; Hou, Y.-M.; Kleiner, R. E.
Show abstract
RNA modifications play an important role in biological processes. Mapping the diversity of RNA chemistry and studying the biological function of individual modifications remains an outstanding challenge in many organisms. In particular, RNA modifications remain poorly studied across most bacterial systems. Our group previously developed RNA-mediated activity-based protein profiling (RNABPP), a reactivity-based strategy employing metabolic labeling and quantitative proteomics to profile RNA modification writer enzymes in human cells. Here we adapt this approach to characterize RNA-modifying enzymes in bacteria. We apply metabolic labeling with 5-fluoropyrimidine nucleosides and phase separation-based enrichment of RNA-protein complexes (RNABPP-PS) to profile RNA pyrimidine modifying enzymes in E. coli and B. subtilis. We identify known and putative bacterial pyrimidine C5 methyltransferases, pseudouridine synthases, and dihydrouridine synthases, demonstrating the utility of our approach. Further, we find the carboxymethylaminomethyluridine (cnmn5U)-forming enzyme MnmG (GidA), supporting the existence of a covalent protein-RNA intermediate during the catalytic cycle. Finally, we use RNABPP-PS in B. subtilis to identify YfjO, an uncharacterized protein that is homologous to 5-methyluridine (m5U) methyltransferases. We use nucleoside and oligonucleotide mass spectrometry to establish that YfjO (which we rename as RlmS) installs m5U620 in the 23S rRNA (U576 in E. coli), a modification specific to the B. subtilis ribosome. We characterize {Delta}yfjO B. subtilis which has impaired cell proliferation and protein translation rate compared to WT. Taken together, our study establishes a versatile platform for RNA modifying enzyme discovery and characterization in bacteria and illuminates species-specific rRNA modification chemistry in B. subtilis.
Quadrini, M.; Tesei, L.
Show abstract
The ability to access, search, and analyse large collections of RNA molecules together with their secondary structure and evolutionary context is essential for comparative and phylogeny-driven studies. Although RNA secondary structure is known to be more conserved than primary sequence, no existing resource systematically associates individual RNA molecules with curated phylogenetic classifications. Here, we introduce PhyloRNA, a curated meta-database that provides large-scale access to RNA secondary structures collected from public resources or derived from experimentally resolved 3D structures. PhyloRNA allows users to search, select, and download extensive sets of RNA molecules in multiple textual formats, each entry being explicitly linked to phylogenetic annotations derived from five curated taxonomy systems. In addition to taxonomic information, each RNA molecule is accompanied by a rich set of descriptors, including pseudoknot order, genus, and three levels of structural abstraction--Core, Core Plus, and Shape--which facilitate comparative analyses across sets of molecules. PhyloRNA is publicly available at https://bdslab.unicam.it/phylorna/ and is regularly updated to incorporate newly available data and revised taxonomic annotations.
Mathis de Fromont, J.; Brosse, A.; Quenette, F.; Guillier, M.
Show abstract
Small regulatory RNAs (sRNAs) are major post-transcriptional regulators in bacteria and, together with transcriptional regulators such as the two-component systems (TCSs), participate in the rapid adaptation of these microorganisms to changing environments. Several examples of paralogous sRNAs with overlapping functions have been reported, that could in theory integrate different environmental cues. Consistent with this idea, we have identified the acid-responsive RstB-RstA two-component system, important for virulence of multiple bacterial species, as a specific multicopy activator of the Escherichia coli OmrB sRNA, but not of the paralogous sRNA OmrA. Further characterization of this regulation unexpectedly revealed the asr-ydgU operon, itself a target of RstB-RstA, as a dual modulator of this TCS via two opposite effects. First, the 27 aminoacids YdgU small protein exerts a negative feedback by directly interacting with RstB and, second, Asr in contrast mediates a positive feedback on RstB-RstA activity via a not completely elucidated mechanism. These results provide a new example of retro-control of a TCS, here RstB-RstA, by one of its direct targets. They further highlight the major role of small proteins in controlling TCS activity and ydgU was thus renamed samT, for Small Acid-responsive Modulator of the RstB-RstA TCS.
Roberson, A. B.; Marks, J.; Pitts, R.; Tamilselvam, B.; Grieb, B.; Tansey, W. P.; Meydan, S.
Show abstract
5-Azacytidine (5-AzaC) is a cytidine analog and is widely used to treat myelodysplastic syndromes (MDS) and acute myeloid leukemia (AML). Although its therapeutic activity is primarily attributed to hypomethylation resulting from DNA incorporation, the majority of 5-AzaC is incorporated into RNA. However, the functional consequences of 5-AzaC incorporation into RNA have been unknown. Here, we show that 5-AzaC treatment of cells leads to inhibition of protein synthesis. Ribo-seq, Disome-seq, and RNA-seq in cells treated with 5-AzaC exhibit a time-dependent C-to-G transversion signature in mRNAs within 2 h of treatment. These transversion events are enriched within footprint positions corresponding to the A-site of monosomes or leading stalled ribosome in a disome complex. Consistently, ribosome and disome footprints are accumulated at sites with C-rich codons in the A-site, specifically with the codons containing a C in the second position. 5-AzaC activates the integrated stress response (ISR) and the ribotoxic stress response (RSR) in a GCN2- and ZAK-dependent manner, consistent with disome-mediated signaling. Furthermore, loss of the Ribosome Quality Control (RQC) factor, ZNF598, sensitizes cells to 5-AzaC. Collectively, our results support a model where 5-AzaC is rapidly incorporated into mRNAs, disrupts decoding, and triggers disome-mediated signaling pathways, which contribute to its cytotoxicity. These findings suggest that translation disruption represents an additional layer of 5-AzaCs mechanism of action, alongside its known DNA-mediated effects.
Wang, D.; Jin, J.; Qiao, J.; Wei, L.; Wu, S.; Liu, Q.
Show abstract
Experimental and predicted RNA three-dimensional structures are expanding rapidly, but RNA structure search still lacks a compact residue-level representation that supports database-scale comparison. Using family-held-out ablations across the currently available experimental RNA structure collection, we found that spatial-neighbour features are markedly more informative for family-level discrimination than conventional backbone and base descriptors. Building on this result, we developed RiboSeek, a search framework based on a 20-letter geometric alphabet (RS-20), an 80-letter structure-and-base composite alphabet (RS-80). Across family-level classification and retrieval benchmarks, RS-80 delivered the strongest overall performance, whereas RS-20 most closely tracked US-align TM-score, indicating better preservation of geometric similarity. RiboSeek searches the full experimental RNA structure database in 204 ms per query and can be applied to predicted RNA structure libraries to prioritize candidate structural relationships for downstream analysis.